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^ Abstract 

(N 

A wavelet is a localized function having a prescribed number of 

'~P vanishing moments. In this correspondence, we provide precise argu- 

■^ ments as to why the Hilbert transform of a wavelet is again a wavelet. 

l^ In particular, we provide sharp estimates of the localization, vanishing 

(— I moments, and smoothness of the transformed wavelet. We work in the 

general setting of non-compactly supported wavelets. Our main result 
is that, in the presence of some minimal smoothness and decay, the 
^1 Hilbert transform of a wavelet is again as smooth and oscillating as the 

original wavelet, whereas its localization is controlled by the number 
— H of vanishing moments of the original wavelet. We motivate our results 

^ using concrete examples. 



ON 
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^ 1 Introduction 

^ It is known that the poor translation-invariance of standard wavelet bases can 

>■ be improved by considering a pair of wavelet bases, whose mother wavelets 

k> are related through the Hilbert transform [|8l [71 [TTl |4l . The advantages of 

;_i using Hilbert wavelet pairs for signal analysis had also been recognized by 

other authors [21 [H. More recently, it was shown in [3| how a Gabor-like 

wavelet transform could be realized using such Hilbert pairs. 

The fundamental reasons why the Hilbert transform can be seamlessly 
integrated into the multiresolution framework of wavelets are its scale and 
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translation invariances, and its energy-preserving (unitary) nature Q . These 
properties are at once obvious from the Fourier-domain definition of the 
transform. We recall that the Hilbert transform V.f{x) of a sufficiently well- 
behaved function f{x) is specified by 



-Hfiu;) 



-jf{uj) for w > 

+j/(w) for w < (1) 

atuj = 0. 



On one hand, the unitary nature ensures that the Hilbert transform of a 
(wavelet) basis of L^(R) is again a basis of L^(R). On the other hand, the 
invariances of scale and translation together provides coherence — the Hilbert 
transform of a wavelet basis generated from the mother wavelet il^ix) is 
simply the wavelet basis generated from the mother wavelet T-LiIj{x). 

The flip side, however, is that the transform is incompatible with scaling 
functions (low-pass functions in general), the building blocks of multires- 
olution analyses. As shown in Figure [T] the transform "breaks-up" scaling 
functions, resulting in the loss of their crucial approximation property. More- 
over, the transformed function exhibits a slow decay. Starting from a given 
multiresolution with associated wavelet basis {ipn)n&z, this presents concep- 
tual difficulties in realizing a dual multiresolution with basis {Hipn)n&z- It 
was shown in (3) that this pathology can, however, be overcome by a careful 
design of the dual multiresolution in which the Hilbert transform is applied 
only on the wavelet, and never explicitly on the scaling function. 

The above-mentioned pathologies can be explained by considering the 
space-domain definition of the transform, which is slightly more involved 
mathematically (see, e.g., [lT2l l6l): 

^/(x) = -lim / f{x-t)^. (2) 



Disregarding the technicalities involving the use of truncations and limits, 
V.f{x) is thus essentially given by the convolution of f{x) with the kernel 
l/vrx (cf Figure [2]). It is now readily seen that the above-mentioned observa- 
tions follow as a consequence of the "oscillating" form of the kernel and its 
slow decay at the tails. We will conveniently switch between definitions ([l]) 
and ([2]) in the sequel. 

Our main observation is that the Hilbert transform goes well with oscil- 
latory patterns, and wavelets in particular. The archetypal relation in this 
regard is its action on pure sinusoids, 

7i[cos{ojox)] = sin(a;o2;). 
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Figure 1: Scaling functions and their Hilbert transforms: (a) The discon- 
tinuous Haar scaling function (BLUE) and its transform (RED), (6) The 
smooth cubic B-spline (BLUE) and its transform (RED). In either case, the 
transformed function is "broken-up" and, as a consequence, loses its ap- 
proximation property. In particular, the transform no longer exhibits the 
partition-of-unity property, which is characteristic of scaling functions. Also, 
note the slow decay of the transform, particular for the smooth spline func- 
tion. In fact, both the transforms decay as l/|x| — the smoothness of the 
original function has no effect on the decay of the transform. 



Thus, the transform tends to preserve oscillations. The nature of the interac- 
tion with localized oscillations is suggested by the relation 

'H[ip{x)cos{uJox)]=if{x)sm{ujox) (3) 

which holds if the localization window (p{x) is bandlimited to (— wq, ^o) f2l|. 
This is an immediate consequence of definition ([T]). The crucial observation, 
however, is that the transformed signal is again smooth (in fact, infinitely 
differentiable) and oscillatory, and importantly, has the same localization 
as the input signal. It is known that a particular family of spline wavelets, 
namely, the B-spline wavelets [|T3ll . converge to a function of the form 
if{x) cos(wox + (f>) with the increase in the order of the spline. In particular, 
it was shown in [31 that the Hilbert transform has comparable localization, 
smoothness, and vanishing moments for sufficiently large orders (cf. Figure 
|3]). It was also shown that the transformed wavelet in fact approaches 
if{x) sin(i:i;o2; + (/>) as the order increases, which is consistent with (|3]). Since, 
more generally, wavelets with sufficient smoothness and vanishing moments 
can be made to closely approximate the form in (|3]), we could in fact arrive 
at a similar conclusion for a larger class of wavelets. 

Using these instances as guidelines, we attempt to answer the following 
basic questions in the sequel: 

• When is the Hilbert transform of a wavelet well-defined? In particular, 
how much smoothness and decay is required? 

• Why does the Hilbert transform of a wavelet exhibit better decay than 
the corresponding scaling function? How does one really get past the 
1/1 x| decay? 

• How good is the localization of the transformed wavelet, how smooth 
is it, and how many vanishing moments does it have? 

2 Notations 

The Fourier transform of f{x) is defined by f{uj) = /^ f{x) exp {—juix) dx. 
We omit the domain of integration when this is obvious from the context. 
We define ||/||i = /!"^|/(x)| dx, and ||/|U = sup{|/(x)| : x G R}. The 
notation Txf{t) denotes the function Txf{t) = f{x — t). We write f{x) = 
0{g{x)),x G A, to signify that |/(x)| < Cg{x) for all x E A, where C is 
an absolute constant. We denote the first derivative of f{x) by f'{x); in 
general, we denote the k-th derivative by f^^\x). We say that /(x) is n-times 




Figure 2: The convolution kernel of the Hillbert transform, I/ttx. It has a 
singularity at the origin and its tails decay slowly. The former pathology can 
be overcome provided that the signal on which the transform is applied is 
sufficiently smooth, while the slow decay can be overcome if the signal is of 
compact support, or at least, of sufficient decay. 

continuously differentiable if all its derivatives up to order n exists and are 
continuous. 

3 Main results 



The kernel l/vrx fails to be absolutely integrable owing to its slow decay 
and, more importantly, its singularity at the origin. The limiting argument 
in ([2]) avoids the singularity by truncating the kernel around the origin in a 
systematic fashion. The slow decay of the kernel, on the other hand, can be 
dealt with by simply restricting the domain of ([2]) to functions with sufficient 
decay. 



As noted in the introduction, the Hilbert transform goes well only with 
smooth functions. This can be readily appreciated by looking at the transform 
of the discontinuous Haar wavelet in Figure [s] In this case, the transform 
"blows-up" in the vicinity of the discontinuities, and is, in fact, not even 
well-defined at the points of discontinuity. The following result, which relies 
on some classical methods of harmonic analysis, explains how this problem 
can be fixed. 

For convenience, we introduce the mixed norm ||/||i,oo = ||/||i+||/'||cx) 
which measures both the local smoothness and the global size of f{x). 

Theorem 3.1 (The classical result). Let f{x) be a differentiable function such 
thatboth ||/||i,oo c^rid ||x/(-)||i,oo are finite. Then T-Lf{x) is well-defined, and 

\nf{x)\ < ^-^ (||/||i,oo + ||x/(-)||i,oo) = 0{\x\-'). (4) 

In particular, this holds true if f{x) is continuously differentiable and is 
of compact support. 

Proof. Consider the basic quantity 

W{f) = -\\m! f{t)^. (5) 



Note that V.f{x) = W{Txf). To begin with, we at least need to guarantee 
that W{f) is well-defined. Note that the integrand in ([5]) is the product 
of the bounded function 1/f (on |i| > e) and the integrable function f{t). 
Therefore, the integral is absolutely convergent for all e > 0. All we need to 
show is that the integral remains convergent as e — > 0. To this end, we split 
the integral in (|5]), and use the anti-symmetry of 1/i to write 

W{f) = - lim / fit) ^ + i / fit) ^ 

'^ "^^ J £<\t\<l t ^ ^|t|>l * 

1 ,. [ ^^, Jit)- fiO) , 1 /■ ^., , dt 
= -lim / Kie,t) '^ ' /"^ ' dt + - fit)-. 

Tre^Oj t ^ J\t\>l t 

where Kie,t) = 1 when e < |t| < 1 and zero otherwise. Clearly, the second 
integral is convergent. As for the first, note that since fix) is bounded, by 
the mean-value theorem, if(e,t)|(/(t) — /(0)/t| < ||/'||oo for t and for all 
e > 0. Therefore, by the dominated convergence theorem. 



lim /i^(£,i) M-^ dt<2\\f' 
e-^0 J t 



In particular, we conclude that W{f) is well-defined, and 

im/)l<^(2||/'||oo+||/||i). (6) 

Since Txf{t) has the same decay and smoothness as f{t), it is now immediate 
that T-Lf{x) is well-defined (pointwise), and that 

\nf{x)\ < -||T,/||i,oo = - (2||/'||oo+||/||i) . (7) 

vr TT 

Next, we note that 



xnf{x) = - lim /" (x - t)f{x -t)— + - [ fit) dt 

^^^Oy|t|>£ t TT J 

= ng{x) + ^l f{t)dt 



where g{x) = xf{x). Since 11(7111,00 is finite, T-Lg{x) is well-defined, and 

\'Hg{x)\<-{2\\g'\\^+\\g\\i). 

TT 

Therefore, 

\xnf{x)\ < - {2\\g'\\^+\\g\\i+\\f\U) . (8) 

TT 

Combining and ([8]), we obtain Q- □ 

We note that the main conclusions of the theorem are well-known results 
in harmonic analysis; e.g., see [12^, 'G']. Moreover, the assumptions under 



which we reproduce these results in Theorem 3.1 are on the conservative 
side. In fact, as can already be seen from our derivation, the transform 
remains well-defined if we replace the constraint ||/'||oo< oo by the weaker 
hypothesis of Lipschitz continuity, that is, if \f{x) — f{y)\ < C\x — y\ for some 
absolute constant C. Our goal here was to introduce some mathematical 
tools which we eventually use to prove our main result. 

3.1 Vanishing moments and decay 

The derivation of the Theorem |3.1| exposes the unfortunate fact that the 
poor l/|x| decay cannot be improved even if f{x) is required to be more 
smooth (cf transform of the cubic spline in Figure [l]), or have a better decay. 



However, it does suggest the following: liT-Lg{x) goes to zero as \x\ goes to 
infinity (which is the case ii g{x) is sufficiently nice), then 



lim x%f{x) = — f{x) dx. 
\x\ — 5>oo n J 



In particular, if f{x) has zero mean, then xT-Lf{x) goes to zero at infinity. 
Therefore, the decay oi T-Lf{x) must be better than l/\x\ in this case. This 
alludes to the connection between the zero-mean condition and the improve- 
ment in decay. To make this more precise, we consider the example of the 
Haar wavelet 

[+1 for - 1 <x <0 

1-1 for < X < 1. 
Let \x\ > 2. Since ip{x) has zero mean, we can write 

nijix) = - lim f ^^ dt 

vr e-^0 7|4_^|>^ x-t 
= - lim / ipit) I ] dt 

- lim / /^ ^ , dt. 



7r£^oy|j_^l>^ x{x-t) 
Now |x - t| > \x\/2 for |x| > 2, and t G [-1, 1]. Hence, 



2 /■! 1 



\'H^lJix)\ < 2 / I^V'WI dt < 



TT \X\ J-1 7r|x| 

Thus, while the Hilbert transform of the Haar scaling function decays only as 
1/ \x\, the transform of the Haar wavelet has a better decay of 1/ |xp. This is 
clearly seen by comparing the plots in Figures [T] and [3j 

We can now generalize the above observation by requiring that, for some 

n > 1, 

x''ip{x) dx = {0 < k <n). 



This vanishing moment property is in fact characteristic of wavelets, which are 
often parametrized by the number n (9) . The following result explains how 
higher vanishing moments can contribute to the increase in the decay of the 
Hilbert transform. The main idea is that the kernel of the Hilbert transform 
effectively behaves as I/ttx"-^^ in the presence of n vanishing moments. 

We use the augmented decay to compute the number of vanishing mo- 
ments of the transformed wavelet. 
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(a) Degree (Haar wavelet) 



(b) Degree 3 (Cubic spline wavelet) 



Figure 3: B-spline wavelets (shown in BLUE) and their Hilbert transforms 
(shown in RED). The wavelets are ordered (left to right) by increasing 
smoothness and vanishing moments; both are compactly supported. Notice 
how the decay of the Hilbert transform increases with the increase in vanish- 
ing moments — the transform of the cubic spline wavelet appears to have an 
almost identical localization. Moreover, it is as smooth as the original wavelet. 
It is shown in the text that, in the presence of some minimal smoothness, the 
Hilbert transform is as smooth and oscillating as the spline wavelet. 



Theorem 3.2 (Decay and vanishing moments). Let %Ij{x) be a differentiable 
wavelet having n vanishing moments. Also, assumethat ||V'||i,oo, ||2;""*"^'(/'(')lli,oo) 
and \\x'^ip{-)\\i are finite. Then T-Lip{x) is well-defined, and 

\nHx)\ < Y^^ (ll^lli.oo + ||x"+V(-)lli.oo + IkXOIIi) = o(|xr"-i). 

(9) 
Moreover, T-Lil){x) has n vanishing moments. 

Before proceeding to the proof, we make some comments. Note that, un- 
der the assumptions on the vanishing moments, ([9]) holds true for compactly 
supported wavelets provided it is continuously differentiable. This in fact is 
the case for the cubic spline wavelet shown in Figure |3| More generally, ^ 
holds if ip{x) is continuously differentiable, has n vanishing moments, and 
satisfies the mild decay conditions 

^(x) = 0(l/|xr+^+"), V'(a;) = 0(l/kr+^+'') (|x|^oo) 

where e and e' are arbitrarily small positive numbers. The significance of the 
above result is that by requiring 'ip{x) to have a large number of vanishing 
moments, we can effectively make T-Lip{x) as localized as ip{x). This had been 
observed qualitatively early on in connection with the wavelet localization of 
the Radon transform [10]. 

Now, we show that ([9]) is sharp, by considering the special case of B-spline 
wavelets. It is known that if iIj{x) is a B-spline wavelet of degree n — 1, then 
7iip{x) is again a (fractional) B-spline wavelet of the same degree, and hence 
has the same decay of l/lxl""*"^ Il3l, [|T4ll . This exactly what is predicted by 
([9]), since ip{x) is known to have n vanishing moments. 



Proof of Theorem \3.2\ It follows from Theorem 3.1 that 'HiIj{x) is well-defined, 
and that 

\mi;{x)\<^{2W\\^+U\\i). (10) 

As for the decay, fix any x away from zero, and let 

, , 1 t t"-i 



X x"^ 



It is clear that 



P{t)tp{t) dt = 0. 
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Using this, we can write 



7r£->o7|4_^l>^ x-t 

- lim / i;{t)( 

T^ e-^Oj\t-x\>e ^X-t 



P(i)) dt 



A simple computation shows that 

1 



X — t 



Pit) 



„n+l 



t" + 



m+l 



X — t 



SO that 



ni^ix) 



1 



TTX 
1 



71+1 



m+l 



Ht) 



n+l 



X 



f'lpit) dt + lim , 

<'^^J\t\-x>e x-t 

f^ipit) dt + ng{x) 



dt 



where g{x) = x'^^^ij){x). Form Theorem 3.1 and the assumptions on ^ipix), it 
follows that 

|x"+i?^V(x)|<^(||xX-)||i + 2||5'||oo + ||5lli). 

Combining this with ( [Tol ), we obtain (|9]). 

As for the vanishing moments of T-Lip{x), note that, since ip{x) has n 
vanishing moments. 



x^'ip{x)\ dx <oo {0 <k <n). 



One can then verify, e.g., using the dominated convergence theorem, that 
ip{u}) is n-times differentiable, and that 



V;W(0) = (-i)" / x'^i^ix) dx {0<k<n). 



(11) 



Therefore, '^(^^(O) = for < /c < n. 

Now, since ^p{x) is square-integrableQ ([l]) holds. It can then be verified 



{k), 



that T-L'ip{uj) is n-times differentiable, and that "HV (0) = for < k < n 



^This follows from the fact that f{x) is both integrable and bounded. The boundedness of 
f{x) is a consequence of its uniform continuity, which in turn follows from the boundedness 
of f'{x). Indeed, uniform continuity along with integrability implies that f{x) — > as 
X — y oo, and this along with continuity implies boundedness. 
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To arrive at the desired conclusion, we note that \T-Lij(x)\ < C/(l + Ixl""*"^), 
whereby 

\x'^'Hil){x)\ dx < oc {0 < k <n). 



/' 



This is sufficient to ensure that pip holds for 'H^{x), thus completing the 



proof. D 

Note that the specialized form of this result is well-know for the particular 
case of n = 1, that is, when the function is of zero mean. For example, 
along with the classical Calderon-Zygmund decomposition (a wavelet-like 
decomposition), this is used to derive certain boundedness properties of 
the transform on the class of integrable functions [161 ■ To the best of our 
knowledge, there is no explicit higher-order generalization of this result 
in the form of Theorem |3.2 in the harmonic analysis or signal processing 
literature. 

3.2 Smoothness 

We now investigate the smoothness oiT-L%lj{x). The route we take capitalizes 
on the Fourier-domain specification of the transform, and the fact that the 
smoothness of a function is related to the decay of its Fourier transform. In 
general, the better the decay of the Fourier transform, the smoother is the 
function, and vice versa. We recall that a finite-energy signal f{x) is said to 
belong to the Sobolev space W^'^(R), 7 > 0, if 

|2 



{I + \uj\y \ f {uj)Y (ko < 00 

The Sobolev embedding theorem asserts that every /(x) belonging to W^''''(R) 
can be identified (almost everywhere) with a function which is n-times con- 
tinuously differentiable provided that 7 > n + 1/2; e.g., see Il9]|. Since ([l]) 
holds true for all finite-energy signals, we immediately conclude that 

Proposition 3.3 (Comparable smoothness). Ifip{x) belongs to W^''>'(R), then 
7iip{x) belongs W^''''(R). In particular, if ^ > n + 1/2^ then both ip{x) and 
7iip{x) are n-times continuously differentiable (almost everywhere). 

For example, the cubic spline wavelet belongs to W^'''^(R) for all 7 < 
3 + 1/2 [14]. This explains the comparable smoothness of the wavelet 
and its transform shown in Figure [3| which are both twice continuously 
differentiable. 
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4 Conclusion 

It has been known for quite some time that the Hilbert transform of a wavelet 
is again a wavelet. In this correspondence, we were concerned with the 
precise understanding of the sense in which this holds true. In particular, we 
formulated certain basic theorems concerning the localization, smoothness, 
and the number of vanishing moments of the Hilbert transform of a wavelet. 
Our main objective was to provide self-contained and straightforward proofs 
of these results along with some concrete examples. 
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